Corpus: fra-ci_web_2015_100K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Veuillez sélectionner 345 335 310 1.20
Daniel Kablan 330 289 262 1.39
comparer Vendeur 317 275 260 1.29
GMT Connection 243 218 206 1.25
Burkina Faso 208 189 167 1.41
Blé Goudé 200 168 163 1.26
St ValenTIC 123 106 106 1.16
San Pedro 121 101 93 1.41
Sol Béni 108 98 96 1.15
Assurez-vous d'indiquer 75 79 64 1.45
Sierra Leone 76 72 67 1.22
Imprimer Partager 49 59 46 1.37
Listing ID 38 51 38 1.34
Fraternité Matin 51 45 43 1.24
New York 61 45 44 1.42
Webb Fontaine 28 33 28 1.18
Charlie Hebdo 39 32 32 1.22
Nelson Mandela 27 32 25 1.38
pharmacies conventionnées 38 30 28 1.45
are no 25 30 24 1.30
606 msec needed at 2018-11-24 20:37